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DETAILED ACTION 

1 . A request for continued examination under 37 CFR 1.114, including the fee set 
forth in 37 CFR 1 .17(e), was filed in this application after final rejection. Since this 
application is eligible for continued examination under 37 CFR 1.114, and the fee set 
forth in 37 CFR 1 .17(e) has been timely paid, the finality of the previous Office action 
has been withdrawn pursuant to 37 CFR 1.114. Applicant's submission filed on 
December 14, 2009 has been entered. 



Election/Restrictions 

2. Newly submitted claim 28 is directed to an invention that is independent or 
distinct from the invention originally claimed for the following reasons: Independent 
claims 1 and 16 are directed to a method and apparatus, respectfully, for classifying 
audio tracks. Newly added independent claim 28 is directed to a portable audio 
playback device and user interface. The independent claims do not share any common 
limitations. 

Since applicant has received an action on the merits for the originally presented 
invention, this invention has been constructively elected by original presentation for 
prosecution on the merits. Accordingly, claim 28 is withdrawn from consideration as 
being directed to a non-elected invention. See 37 CFR 1 .142(b) and MPEP § 821 .03. 



Claim Rejections - 35 USC §112 

3. The following is a quotation of the first paragraph of 35 U.S. C. 112: 
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The specification shall contain a written description of the invention, and of the manner and process of 
making and using it, in such full, clear, concise, and exact terms as to enable any person skilled in the 
art to which it pertains, or with which it is most nearly connected, to make and use the same and shall 
set forth the best mode contemplated by the inventor of carrying out his invention. 

4. Claims 20, 22, and 23 are rejected under 35 U.S.C. 112, first paragraph, as 
failing to comply with the written description requirement. The claim(s) contains subject 
matter which was not described in the specification in such a way as to reasonably 
convey to one skilled in the relevant art that the inventor(s), at the time the application 
was filed, had possession of the claimed invention. Claims 20 and 23 recite the 
limitation "being a neighbour of the second cluster" however the term "neighbour" does 
not appear in the specification as originally filed. Further claim 22 recites the limitation 
"wherein the defined minimum level depends on the number of tracks in existing 
clusters" however after careful review of the specification as originally filed as 
relationship between the "defined minimum level" and "the number of tracks in other 
existing clusters" could not be found. Appropriate correction and/or clarification is 
required. 

5. The following is a quotation of the second paragraph of 35 U.S.C. 1 12: 

The specification shall conclude with one or more claims particularly pointing out and distinctly 
claiming the subject matter which the applicant regards as his invention. 

6. Claims 20 and 23 are rejected under 35 U.S.C. 112, second paragraph, as being 
indefinite for failing to particularly point out and distinctly claim the subject matter which 
applicant regards as the invention. The term "neighbour" in claims 20 and 23 is a 
relative term which renders the claim indefinite. The term "neighbour" is not defined by 
the claim, the specification does not provide a standard for ascertaining the requisite 
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degree, and one of ordinary skill in the art would not be reasonably apprised of the 
scope of the invention. 

Claim Rejections - 35 USC § 103 

7. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

8. Claims 1, 2, 4 - 10, 12, 13, 16-20, 23, and 26 are rejected under 35 U.S.C. 
103(a) as being unpatentable over Obrador (US 7,149,755 B2), hereinafter Obrador , In 
view of Khan et al. (US 7,277,766), hereinafter Khan , and Liou et al. (US 6,278,446), 
hereinafter Liou . 

Claim 1 : Obrador discloses a method for creating or accessing a menu for audio 
content stored in a storage means, the content consisting of audio tracks, and the menu 
containing representations of said audio tracks, the method comprising: 

classifying ("organized") the audio tracks ("As used herein, the term "media 
object" refers broadly to any form of digital content, including text, audio, graphics, 
animated graphics and full-motion video," Column 3 Lines 55 - 58 also "digital content 
may be compressed using a compression format that is selected based upon the digital 
content type (e.g., an MP3 or a WMA compression format for audio works," Column 4 
Lines 3-6) into groups or clusters (see "Browsing a Media Object Cluster Hierarchy," 
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Column 9), wherein said classification is performed according to characteristic 
parameters of said audio tracks ("The metadata similarity may correspond to low-level 
features (e.g., motion activity, texture or color content, and audio content) or high-level 
features (e.g., meta data, such as keywords and names; objects, such as persons, 
places and structures; and time-related information, such as playback length and media 
object creation date). One or more known media object processing techniques (e.g., 
pattern recognition techniques, voice recognition techniques," Column 9 Lines 53 - 67); 

detecting addition of a new audio track ("As these collection grow in number and 
diversity, individuals and organizations increasingly will require systems and methods 
for organizing and browsing the digital content of their collections," Column 1 Lines 1 8 - 
21 , and therefore the system must detect new audio tracks in order to organize the 
growing collection.); 

determining characteristic parameters of the new audio track ("metadata 
similarity criteria"). 

Obrador does not disclose wherein said characteristic parameters comprise 
physical features, perceptual features, and psychological features, wherein, physical 
features comprise one or more of spectral centroid, short-time energy, or short-time 
average zero-crossing, and wherein perceptual features comprise one or more of 
rhythm and tonality. Obrador does state with reference to browsing and organizing 
media, "In some embodiments, the relevance criteria used to select the media objects 
that will be presented contemporaneously with the selected media file may relate to a 
selected metadata similarity between media objects and the selected media file. The 
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metadata similarity may correspond to low-level features (e.g., motion activity, texture or 
color content, and audio content) or high-level features (e.g., meta data, such as 
keywords and names; objects, such as persons, places and structures; and time-related 
information, such as playback length and media object creation date). One or more 
known media object processing techniques (e.g., pattern recognition techniques, voice 
recognition techniques, color histogram-based techniques, and automatic pan/zoom 
motion characterization processing techniques) may be used to compare media objects 
to the selected media file in accordance with the selected metadata similarity criterion," 
Column 9 Lines 48 - 67. Khan discloses a method and system for analyzing digital 
audio files. Khan teaches, "One advantage of the foregoing aspects of the present 
invention is that unique audio signatures may be assigned to audio files. Also various 
attributes may be tagged to audio files. The present invention can generate a 
customized playlist for a user based upon audio file content and the attached attributes. 
Hence making the music searching experience easy and customized," Column 3 Lines 
24 - 30. "Some of the features that can be associated with the audio files are: (a) 
Emotional quality vector values that indicates whether an audio file content is Intense, 
Happy, Sad Mellow, Romantic, Heartbreaking, Aggressive or Upbeat, (b) Vocal vector 
values that indicates whether the audio file content includes a Sexy voice, a Smooth 
voice, a Powerful voice, a Great voice, or a Soulful voice, (c) Sound quality vector 
values that indicates whether the audio file content includes a strong beat, is simple, 
has a good groove, is fast, is speech like or emphasizes a melody, (d) Situational 
quality vector values that indicate whether the audio file content is good for a workout, a 
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shopping mall, a dinner party, a dance party, slow dancing or studying, (e) Ensemble 
vector values indicating whether the audio file includes a female solo, male solo, female 
duet, male duet, mined duet, female group, male group or instrumental, (f) Genre vector 
values that indicate whether the audio file content belongs to a plurality of genres 
including Alternative, Blues, County, Electronics/Dance, Folk, Gospel, Jazz, Latin, New 
Age, Rhythm and Blues (R and B), Soul, Rap, Hip-Hop, Reggae, Rock and others, (g) 
Instrument vectors that indicates whether the audio file content includes an acoustic 
guitar, electric guitar, bass, drum, harmonica, organ, piano, synthesizer, horn or 
saxophone," Column 7 Lines 19-45. Khan continues, "As discussed in step S901, 
certain features or parameters are extracted from an audio file signal. The features of 
this methodology are based on Short Time Fourier Transform (STFT) analysis," Column 
8 Lines 56 - 60. The following STFT-based features may be extracted in step S901 : 
Spectral Centroid, Spectral Rolloff, Spectral Flux, Peak Ratio, Subband energy vector, 
Subband flux, and Subband Energy Ratios, Column 9 Lines 12-56. Therefore, since 
Obrador suggests using various analysis techniques to capture various features of 
audio content, it would have been obvious to one of ordinary skill in the art at the time of 
the invention to use the well known digital audio analysis techniques as disclosed by 
Khan to capture the various features of the digital audio content, thereby realizing the 
aforementioned advantages. 

Obrador and Khan further disclose selecting automatically a first audio track as 
being a representative for the cluster, wherein the medoid of the cluster is selected ("For 
example, the media objects may be ordered in accordance with a selected context 
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criterion, and the representative media object may correspond to the centroid or some 
other statistically-weighted average of a selected cluster of the ordered media objects," 
Obrador Column 10 Lines 35 - 39); 

automatically generating a reproducible audio extract from said representative 
audio track; and associating said audio extract as representative of said cluster to a 
menu list ("Media objects 98 may be indexed with logical links into the set of data 
structure sequences, as shown in Fig. 8A. Each data structures sequence link into a 
media file may be identify a starting point in the media file and the length of the 
corresponding sequence," Obrador Column 7 Lines 46 - 50 also "The media file and the 
media objects preferably are presented to the user through multimedia album page, 
which is a windows-based GUI that is displayed on a display monitor 42 (Fig. 2)," 
Obrador Column 8 Lines 3 - 7). 

While Obrador and Khan must be able to detect and determine characteristic 
parameters of new audio tracks in order to organize and add to the growing collection, 
Obrador and Khan do not disclose the specifics of clustering the newly added track thus 
one of ordinary skill in the art at the time of the invention would be motivated to look 
elsewhere for such a teaching since a teaching of how new tracks are added is 
necessary for organizing a dynamic database as taught by Obrador and Khan . As a 
result of the missing teachings, Obrador and Khan are silent to the claimed limitations 
regarding, determining that dissimilarity between the newly added track and existing 
clusters, according to said characteristic parameters used for classification, reaches at 
least a defined minimum level; upon said determining, automatically creating a new 
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cluster; assigning the new audio track to said new cluster, second cluster; upon said 
creating the second cluster, classifying one or more further audio tracks of said audio 
tracks into the second cluster. 

Liou discloses a method for organization and browsing of media similar to 
Obrador . Liou further teaches in the art of clustering with regards to organizing media 
and in particular adding new media," The preferred shot [in the case of Obrador and 
Khan , the "shot" would refer to the "audio track"] grouping method is based on nearest 
neighbor classification, combined with a threshold criterion. This method satisfies the 
constraints discussed above, where no a priori knowledge or model is used. The initial 
clusters are generated based on the color feature vector [in the case of Obrador and 
Khan , the "color feature vector" would refer to the "audio feature vector"] of the shots 
[audio tracks]. Each initial cluster is specified by a feature vector which is the mean of 
the color feature vectors [audio feature vectors] of its members. When a new shot 
[audio track] is available, the city block distance between its color feature vector [audio 
feature vector] and the means or feature vectors of the existing clusters is computed. 
The new shot [audio track] is grouped into the cluster with the minimum distance from 
its feature vector, provided the minimum distance is less than a threshold. If an existing 
cluster is found for the new shot [audio track], the mean (feature vector) of the cluster is 
updated to include the feature vector of the new shot [audio track]. Otherwise, a new 
cluster is created with the feature vector of the new shot [audio track] as its mean. The 
threshold is selected based on the percentage of the image pixels [e.g., audio samples] 
that need to match in color [audio feature], in order to call two images [audio tracks] 
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similar," Column 10 Lines 35 - 53 and Figure 11. Liou further states, "Other features 
may also be used to produce the clusters, including audio similarity," Column 1 1 Lines 1 
-5. 

Therefore, as seen above as indicated by the bracketed text added by the 
Examiner, it would have been obvious to one of ordinary skill in the art of clustering at 
the time of the invention to incorporate the teachings of Liou regarding the claimed 
determining that dissimilarity between the newly added track and existing clusters, 
according to said characteristic parameters used for classification, reaches at least a 
defined minimum level; upon said determining, automatically creating a new cluster; 
assigning the new audio track to said new cluster, classifying the audio tracks into the 
groups or clusters, including the second cluster, thus motivating one of ordinary skill in 
the art to look elsewhere for such a teaching in order to realize the invention, in the 
invention of Qbrador and Khan thereby providing a teaching for adding new audio tracks 
in a suitable manner in order to organize the growing collection of Qbrador and Khan , 
since as stated by Liou , "Other features may also be used to produce the clusters, 
including audio similarity," Column 1 1 Lines 1 - 5. 

Claim 2: Qbrador , Khan , and Liou disclose the method according to claim 1 , wherein 
said characteristic parameters used for classification of audio content comprise one or 
more audio descriptors, the audio descriptors being either physical features, or 
perceptual features, or psychological or social features of the audio content ("The 
metadata similarity may correspond to low-level features (e.g., motion activity, texture or 
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color content, and audio content) or high-level features (e.g., meta data, such as 
keywords and names; objects, such as persons, places and structures; and time-related 
information, such as playback length and media object creation date). One or more 
known media object processing techniques (e.g., pattern recognition techniques, voice 
recognition techniques," Obrador Column 9 Lines 53 - 67) 

Claim 4: Obrador , Khan , and Liou disclose the method according to claim 1 , wherein 
the audio tracks within a cluster have variable order, so that the user listens to a 
randomly selected track when having selected a cluster, with said track belonging to 
said cluster (variable based on similarity, Obrador ). 

Claim 5: Obrador , Khan , and Liou disclose the method according to claim 1 , wherein a 
user can modify the result of automatic classification of audio tracks (e.g., by choosing a 
different anchor, Obrador ). 

Claim 6: Obrador . Khan , and Liou disclose the method according to claim 1 , wherein a 
user can modify the classification rules for automatic classification of audio tracks (e.g., 
by choosing a different anchor, Obrador ). 

Claim 7: Obrador . Khan , and Liou disclose the method according to claim 1 , wherein 
the actual audio data are clustered within said storage means according to said menu 
("The media file and the media objects preferably are presented to the user through 
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multimedia album page, which is a windows-based GUI that is displayed on a display 
monitor 42 (Fig. 2)," Obrador Column 8 Lines 3 - 7). 

Claim 8: Obrador , Khan , and Liou disclose the method according to claim 1 , wherein 
the audio extract is a sample from the audio track ("Media objects 98 may be indexed 
with logical links into the set of data structure sequences, as shown in Fig. 8A. Each 
data structures sequence link into a media file may be identify a starting point in the 
media file and the length of the corresponding sequence," Obrador Column 7 Lines 46 - 
50). 

Claim 9: Obrador , Khan , and Liou disclose the method according to claim 1 , wherein 
audio extracts are created additionally for audio tracks not being representatives of 
clusters ("Media objects 98 may be indexed with logical links into the set of data 
structure sequences, as shown in Fig. 8A. Each data structures sequence link into a 
media file may be identify a starting point in the media file and the length of the 
corresponding sequence," Obrador Column 7 Lines 46 - 50). 

Claim 10: Obrador , Khan , and Liou disclose the method according to claim 1 , wherein 
the length of audio extracts is not predetermined ("Media objects 98 may be indexed 
with logical links into the set of data structure sequences, as shown in Fig. 8A. Each 
data structures sequence link into a media file may be identify a starting point in the 
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media file and the length of the corresponding sequence," Obrador Column 7 Lines 46 - 
50. 

Claim 12: Obrador , Khan , and Liou disclose the method according to claim 1, wherein 
said menu is hierarchical, such that a cluster may contain one or more subclusters (see 
"Browsing a Media Object Cluster Hierarchy," Obrador Column 9). 

Claim 13: Obrador , Khan , and Liou disclose the method according to claim 1 , wherein 
the classification rules are modified automatically if a defined precondition is detected, 
and a reclassification may be performed (e.g., by choosing a different anchor, Obrador ). 

Claim 20: Obrador . Khan , and Liou disclose the method according to claim 1 , wherein 
said one or more further audio tracks classified into the second cluster were previously 
classified in a first cluster being a neighbour of the second cluster ("Delete member from 
cluster and place in new cluster recomputed cluster mean," step L Figure 1 1 of Liu), the 
method further comprising the steps of: 

selecting automatically a second audio track being a representative for the first 
cluster, wherein the medoid of the new cluster is selected (Since "objects are grouped 
into clusters, each of which preferably contains a fixed number of media objects," there 
must be the creation of new clusters when the collection grows in number and diversity, 
and therefore the system selects a media object corresponding to "the centroid or some 
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other statistically-weighted average of a selected cluster of the ordered media objects," 
Obrador Column 10 Lines 18 - 39); 

automatically generating a reproducible new audio extract from the second audio 
track; and associating said new audio extract of the second audio track as 
representative of the first cluster to the menu list ("Media objects 98 may be indexed 
with logical links into the set of data structure sequences, as shown in Fig. 8A. Each 
data structures sequence link into a media file may be identify a starting point in the 
media file and the length of the corresponding sequence," Obrador Column 7 Lines 46 - 
50 also "The media file and the media objects preferably are presented to the user 
through multimedia album page, which is a windows-based GUI that is displayed on a 
display monitor 42 (Fig. 2)," Obrador Column 8 Lines 3 - 7). 

Claim 26: Obrador , Khan , and Liou disclose the method according to claim 20, wherein 
another audio track was a representative of the first cluster before the new audio track 
was added, and said first audio track being representative of the first cluster is different 
from the other audio track that was representative of the first cluster before the new 
audio track was added ("For example, the media objects may be ordered in accordance 
with a selected context criterion, and the representative media object may correspond to 
the centroid or some other statistically-weighted average of a selected cluster of the 
ordered media objects," Obrador Column 10 Lines 35 - 39, and "Update Cluster Mean", 
"Recompute Cluster Mean", Liou Figure 1 1 ). 



Application/Control Number: 1 0/541 ,577 Page 1 5 

Art Unit: 2614 

Claims 16 - 18 are substantially similar in scope to claim 1 and is also disclosed in 
Figure 2, and therefore is rejected for the same reasons as claim 1 with addition of 
Figure 2. 

Claim 19: Obrador , Khan , and Liou disclose the method according to claim 1 , wherein 
the audio extract is an audio sequence being synthesized from the actual audio track 
rather than being an original sample ("Media objects 98 may be indexed with logical 
links into the set of data structure sequences, as shown in Fig. 8A. Each data structures 
sequence link into a media file may be identify a starting point in the media file and the 
length of the corresponding sequence. The data structure sequences may be 
consecutive, as shown in FIG. 8B, or non-consecutive," Obrador Column 7 Lines 46 - 
55, and therefore a non-original sample sequence. 

Claim 23 is substantially similar in scope to claim 20 and therefore is rejected for the 
same reasons. 

9. Claims 3, 1 1 , and 27 are rejected under 35 U.S.C. 1 03(a) as being unpatentable 
over Obrador , Khan , and Liou in view of Piatt (US 6,987,221 ), hereinafter Piatt. 

Claim 3: Obrador , Khan , and Liou disclose the method according to claim 1 , but do not 
disclose whether or not an audio track can be classified into more than one cluster. Piatt 
discloses a similar clustering technique for audio and while not explicitly stated teaches, 
the tracks are placed in the playlist based upon the results of a vector which is based 
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upon multiple attributes of the item (Column 10 Lines 9 - 48). Therefore, it would have 
been obvious to one of ordinary skill in the art that when generating multiple playlists as 
disclosed by Piatt that the system of Piatt may decide that a song may have the 
minimum required attributes necessary to match more than one playlist category and 
therefore be classified in more than one playlist. Since excluding songs from being in 
more than one playlist would be disadvantages to the user, since the user wants the 
best matching songs in each playlist. Therefore, when applying a similar technique in 
Obrador , Khan , and Liou , it would have been obvious to one of ordinary skill in the art at 
the time of the invention to generate clusters in a similar manner. 

Claim 27: Obrador , Khan , Liou , and Piatt disclose the method according to claim 3, 
wherein a track is classified into two clusters and both clusters contain a link to said 
track ("link", "pointer", Obrador Column 6 Lines 39 - 47) and wherein the track is stored 
only once ("all media files in a selected collection are stored only once in data base 96 
(FIG. 7B)," Obrador Column 7 Lines 40 - 45). 

Claim 1 1 : Obrador , Khan , and Liou disclose the method according to claim 1 , but do 
disclose wherein one of said clusters has no representative track. Piatt discloses a 
similar clustering technique for audio and while not explicitly stated teaches how to 
determine the order among seed items when more than one seed item is selected. And 
therefore while one of ordinary skill in the art may consider any one of the seed items in 
this case to be the representative track, it would also have been obvious to one of 
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ordinary skill in the art at the time of the invention that a representative track does not 
exist since a determination cannot be made among seed items. Therefore, when 
applying a similar technique in Qbrador , Khan , and Liou, it would have been obvious to 
one of ordinary skill in the art at the time of the invention to generate clusters in a similar 
manner. 

10. Claims 14 and 15 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Qbrador , Khan , and Liou in view of Mercer et al. (US 7,043,477), hereinafter 
Mercer . 

Claims 14 and 15: Qbrador discloses the method according to claim 13, but do not 
disclose wherein said precondition comprises that the difference between the number of 
tracks in a cluster and the number of tracks in another cluster reaches a maximum limit 
value, and wherein said precondition comprises that all stored tracks were classified 
into one cluster, and the total number of tracks reaches a maximum limit value. Mercer 
discloses where bounds are set when determining the size of playlists (Column 8 Line 
40 - Column 9 Line 62). Therefore, it would have been obvious to one of ordinary skill in 
the art given the teaching of Mercer to incorporate a limit between two playlists or a 
single sequence in the invention of Qbrador , Khan , and Liou to determine how 
classification is performed, thereby allowing for example "If composer information is 
available for some of the selected media files (e.g., "if greater than twenty-five percent), 



Application/Control Number: 1 0/541 ,577 Page 1 8 

Art Unit: 2614 

the authoring software creates a menu 'Composer' ..." thereby further automating the 
classification process, Mercer Column 9 Lines 22 - 27. 

1 1 . Claims 22 and 24 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Obrador , Khan , and Liou in view of Robinson (US 7,072,846 B1 ), hereinafter 
Robinson, with further support from Ferhatosmanoglu et al. (Approximate Nearest 
Neighbor Searching in Multimedia Databases), hereinafter Ferhatosmanoglu . 

Claim 22: Obrador , Khan , Liou but do not disclose the method according to claim 1 but 
do not disclose wherein the defined minimum depends on the number of tracks in other 
existing clusters. Robinson discloses a similar method and system for clustering songs 
and recommending the best song in the cluster to the user, Column 1 3 Lines 46 - 54. 
Robinson also teaches setting "the average number of songs desired per cluster," 
Column 4 Lines 65 - 67, similar to Qbrador's teaching of fixing the number of media 
objects in a cluster. Robinson further explains, "As new songs are added to the system, 
new clusters are automatically created such that the average number of songs remains 
approximately the same; the optimization process then populates the cluster. These 
clusters, in various embodiments, may start out empty before they are optimized, or 
may be initially populated with new songs or randomly chosen songs," Column 4 Line 
67 - Column 5 Line 7. Robinson also teaches that a wide range of clustering 
approaches fall within the scope of the invention and gives provides source code for the 
standard k-means clustering concept as an example. To further support the technique 
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of Robinson , Ferhatosmanoglu teaches the "k-means algorithm [13] iteratively 
constructs a number of clusters with a representative for each cluster such that the error 
in representation is minimized," Page 506 Column 2. Ferhatosmanoglu like Obrador and 
Robinson also teaches the clustering algorithm limits "the size of each cluster from both 
above and below," Page 507 Column 1 . Ferhatosmanoglu explains, "If the size goes 
above the upper threshold, the cluster is split into two. If the size goes down below the 
lower threshold, then the cluster centroid is erased from the list of centroids. To split a 
cluster, we first duplicate the cluster centroid, and then perturb the exact copies 
randomly. It is known that K-means algorithm is sensitive to initialization. Since we have 
this splitting mechanism, instead of starting from cluster centroids chosen by some pre- 
processing scheme, we start by a single cluster, and the algorithm automatically creates 
new clusters until the population of each cluster is below the threshold. As we will 
demonstrate later, by having a lower threshold for cluster size, several queries can be 
answered by retrieving only a very small number of clusters. Also, by limiting the cluster 
sizes from above, we avoid extremely unbalanced distribution of data over the clusters. 
Although the minimum and maximum cluster sizes are not dominant factors in the 
performance of our technique, reasonable values need to be set for the design 
purposes. Therefore, given the teachings of Robinson and Ferhatosmanoglu , it would 
have been obvious to one of ordinary skill in the art at the time of the invention to use 
the k-means algorithm as suggested by Robinson and further explained by 
Ferhatosmanoglu with limits placed on the size of the clusters when adding new songs 
to a collection in the invention of Obrador , Khan , and Liou, thereby realizing the 
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aforementioned advantages while fixing the number of media objects in a cluster that 
may be conveniently presented to a user at the same time (Obrador Column 10 Lines 
23-27). 

Claim 24: Obrador , Khan , and Liou disclose the apparatus according to claim 16 but do 
not disclose wherein the means for assigning one or more of the audio tracks of said 
first cluster to the new second cluster to the new cluster uses the K-means algorithm to 
decide which audio tracks are assigned to the second cluster. Robinson discloses a 
similar method and system for clustering songs and recommending the best song in the 
cluster to the user, Column 1 3 Lines 46 - 54. Robinson also teaches setting "the 
average number of songs desired per cluster," Column 4 Lines 65 - 67, similar to 
Obrador's teaching of fixing the number of media objects in a cluster. Robinson further 
explains, "As new songs are added to the system, new clusters are automatically 
created such that the average number of songs remains approximately the same; the 
optimization process then populates the cluster. These clusters, in various 
embodiments, may start out empty before they are optimized, or may be initially 
populated with new songs or randomly chosen songs," Column 4 Line 67 - Column 5 
Line 7. Robinson also teaches that a wide range of clustering approaches fall within the 
scope of the invention and gives provides source code for the standard k-means 
clustering concept as an example. To further support the technique of Robinson . 
Ferhatosmanoqlu teaches the "k-means algorithm [13] iteratively constructs a number of 
clusters with a representative for each cluster such that the error in representation is 
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minimized," Page 506 Column 2. Ferhatosmanoqlu like Obrador and Robinson also 
teaches the clustering algorithm limits "the size of each cluster from both above and 
below," Page 507 Column 1 . Ferhatosmanoqlu explains, "If the size goes above the 
upper threshold, the cluster is split into two. If the size goes down below the lower 
threshold, then the cluster centroid is erased from the list of centroids. To split a cluster, 
we first duplicate the cluster centroid, and then perturb the exact copies randomly. It is 
known that K-means algorithm is sensitive to initialization. Since we have this splitting 
mechanism, instead of starting from cluster centroids chosen by some pre-processing 
scheme, we start by a single cluster, and the algorithm automatically creates new 
clusters until the population of each cluster is below the threshold. As we will 
demonstrate later, by having a lower threshold for cluster size, several queries can be 
answered by retrieving only a very small number of clusters. Also, by limiting the cluster 
sizes from above, we avoid extremely unbalanced distribution of data over the clusters. 
Although the minimum and maximum cluster sizes are not dominant factors in the 
performance of our technique, reasonable values need to be set for the design 
purposes. Therefore, given the teachings of Robinson and Ferhatosmanoqlu . it would 
have been obvious to one of ordinary skill in the art at the time of the invention to use 
the k-means algorithm as suggested by Robinson and further explained by 
Ferhatosmanoqlu with limits placed on the size of the clusters when adding new songs 
to a collection in the invention of Obrador . Khan , and Liou, thereby realizing the 
aforementioned advantages while fixing the number of media objects in a cluster that 
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may be conveniently presented to a user at the same time (Obrador Column 10 Lines 
23-27). 

Response to Arguments 

12. Applicant's arguments filed December 14, 2009 have been fully considered but 
they are not persuasive. Applicant argues on page 1 1 of the remarks, "the present 
invention as claimed in claim 1 comprises automatic re-classification as a response to 
the detection of the addition of a new track; wherein further tracks that were previously 
classified in the neighbour cluster may also be classified in the new cluster, since they 
may be closer to the newly added track than their previous medoid," however Liu clearly 
shows "automatic re-classification" in Figure 11 beginning at step H and ending at step 
P. Further, claim 1 does not comprise the limitation of "wherein further tracks that were 
preciously classified in the neighbour cluster may also be classified in the new cluster," 
this limitation is found in newly amended claims 20 and 23 however as indicated in the 
rejection above this limitation is found in Liu Figure 1 1 step L. Also, per the 1 12 rejection 
the term "neighbour" is not supported by the specification as originally filed. 

While Applicant remarks that in Liou "it is not possible that a shot may be a 
member of more than one cluster" this limitation is not found in independent claim 1 . A 
similar limitation is found in claim 3 however the Examiner relied on Piatt for teaching 
this limitation. 

Further Applicant argues, "Liou teaches not to use K-means clustering," however 
the specific teaching of Liu at the column and lines indicted by Applicant states, "The 
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number of potential clusters a priori is not known, so known K-means clustering and 
other strategies using this a priori information are also not useful". Therefore Liou does 
not teach that you would not want to ever want to use K-means clustering only that K- 
means clustering techniques known to Liou at the time are not useful. Again it is noted 
that this limitation regarding K-means does not appear in the independent claims but in 
claim 24 in which the Examiner relied on Robinson and Ferhatosmanoglu for teaching a 
K-means algorithms that address the original concerns of Liou. 

Finally, in response to applicant's argument that the examiner's conclusion of 
obviousness is based upon improper hindsight reasoning, it must be recognized that 
any judgment on obviousness is in a sense necessarily a reconstruction based upon 
hindsight reasoning. But so long as it takes into account only knowledge which was 
within the level of ordinary skill at the time the claimed invention was made, and does 
not include knowledge gleaned only from the applicant's disclosure, such a 
reconstruction is proper. See In re McLaughlin, 443 F.2d 1392, 170 USPQ 209 (CCPA 
1971). Further, the Examiner has amended the wording in the rejection to clarify the 
motivation for the combination. 

Conclusion 

1 3. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Joseph Saunders whose telephone number is (571) 
270-1063. The examiner can normally be reached on Monday - Thursday, 9:00 a.m. - 
4:00 p.m., EST. 
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If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Vivian Chin can be reached on (571) 272-7848. The fax phone number for 
the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 



/J. S./ 

Examiner, Art Unit 2614 
/Vivian Chin/ 

Supervisory Patent Examiner, Art Unit 2614 



